Overview

Dataset statistics

Number of variables15
Number of observations4589
Missing cells12850
Missing cells (%)18.7%
Total size in memory2.6 MiB
Average record size in memory599.0 B

Variable types

Numeric5
Text10

Alerts

machinery_cropmgt_types has 3172 (69.1%) missing valuesMissing
machinery_cropmgt_access has 3172 (69.1%) missing valuesMissing
machinery_cropmgt_constraint has 1428 (31.1%) missing valuesMissing
constraint_fertilizers has 559 (12.2%) missing valuesMissing
constraint_organic_inputs has 4385 (95.6%) missing valuesMissing
count_infants has 3271 (71.3%) zerosZeros
count_children_2_4 has 2975 (64.8%) zerosZeros
count_children_5_14 has 1379 (30.1%) zerosZeros

Reproduction

Analysis started2023-10-16 20:54:23.819104
Analysis finished2023-10-16 20:54:24.686162
Duration0.87 seconds
Software versionydata-profiling vv4.6.0
Download configurationconfig.json

Variables

count_infants
Real number (ℝ)

ZEROS 

Distinct8
Distinct (%)0.2%
Missing8
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean0.3246016154
Minimum0
Maximum7
Zeros3271
Zeros (%)71.3%
Negative0
Negative (%)0.0%
Memory size36.0 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile1
Maximum7
Range7
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.5776623192
Coefficient of variation (CV)1.779603957
Kurtosis14.88415695
Mean0.3246016154
Median Absolute Deviation (MAD)0
Skewness2.635838364
Sum1487
Variance0.333693755
MonotonicityNot monotonic
Histogram
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
0 3271
71.3%
1 1178
 
25.7%
2 109
 
2.4%
3 12
 
0.3%
4 5
 
0.1%
5 3
 
0.1%
7 2
 
< 0.1%
6 1
 
< 0.1%
(Missing) 8
 
0.2%
ValueCountFrequency (%)
0 3271
71.3%
1 1178
 
25.7%
2 109
 
2.4%
3 12
 
0.3%
4 5
 
0.1%
ValueCountFrequency (%)
7 2
 
< 0.1%
6 1
 
< 0.1%
5 3
 
0.1%
4 5
0.1%
3 12
0.3%

count_children_2_4
Real number (ℝ)

ZEROS 

Distinct7
Distinct (%)0.2%
Missing8
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean0.4171578258
Minimum0
Maximum6
Zeros2975
Zeros (%)64.8%
Negative0
Negative (%)0.0%
Memory size36.0 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile2
Maximum6
Range6
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.6362093313
Coefficient of variation (CV)1.525104629
Kurtosis4.247665436
Mean0.4171578258
Median Absolute Deviation (MAD)0
Skewness1.691793613
Sum1911
Variance0.4047623133
MonotonicityNot monotonic
Histogram
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
0 2975
64.8%
1 1350
29.4%
2 219
 
4.8%
3 28
 
0.6%
4 7
 
0.2%
6 1
 
< 0.1%
5 1
 
< 0.1%
(Missing) 8
 
0.2%
ValueCountFrequency (%)
0 2975
64.8%
1 1350
29.4%
2 219
 
4.8%
3 28
 
0.6%
4 7
 
0.2%
ValueCountFrequency (%)
6 1
 
< 0.1%
5 1
 
< 0.1%
4 7
 
0.2%
3 28
 
0.6%
2 219
4.8%

count_children_5_14
Real number (ℝ)

ZEROS 

Distinct10
Distinct (%)0.2%
Missing8
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean1.336826021
Minimum0
Maximum9
Zeros1379
Zeros (%)30.1%
Negative0
Negative (%)0.0%
Memory size36.0 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile4
Maximum9
Range9
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.234977631
Coefficient of variation (CV)0.9238132801
Kurtosis1.915033908
Mean1.336826021
Median Absolute Deviation (MAD)1
Skewness1.051094114
Sum6124
Variance1.525169749
MonotonicityNot monotonic
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
0 1379
30.1%
1 1335
29.1%
2 1167
25.4%
3 464
 
10.1%
4 159
 
3.5%
5 55
 
1.2%
6 10
 
0.2%
7 7
 
0.2%
9 3
 
0.1%
8 2
 
< 0.1%
(Missing) 8
 
0.2%
ValueCountFrequency (%)
0 1379
30.1%
1 1335
29.1%
2 1167
25.4%
3 464
 
10.1%
4 159
 
3.5%
ValueCountFrequency (%)
9 3
 
0.1%
8 2
 
< 0.1%
7 7
 
0.2%
6 10
 
0.2%
5 55
1.2%

count_adults
Real number (ℝ)

Distinct10
Distinct (%)0.2%
Missing8
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean3.261078367
Minimum0
Maximum9
Zeros3
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size36.0 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile2
Q12
median3
Q34
95-th percentile6
Maximum9
Range9
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.462019172
Coefficient of variation (CV)0.4483238388
Kurtosis0.8282035906
Mean3.261078367
Median Absolute Deviation (MAD)1
Skewness0.9598958962
Sum14939
Variance2.13750006
MonotonicityNot monotonic
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
2 1624
35.4%
3 1051
22.9%
4 856
18.7%
5 517
 
11.3%
6 228
 
5.0%
1 171
 
3.7%
7 77
 
1.7%
8 35
 
0.8%
9 19
 
0.4%
0 3
 
0.1%
(Missing) 8
 
0.2%
ValueCountFrequency (%)
0 3
 
0.1%
1 171
 
3.7%
2 1624
35.4%
3 1051
22.9%
4 856
18.7%
ValueCountFrequency (%)
9 19
 
0.4%
8 35
 
0.8%
7 77
 
1.7%
6 228
5.0%
5 517
11.3%
Distinct80
Distinct (%)1.7%
Missing8
Missing (%)0.2%
Memory size307.8 KiB
Mini wordcloud

Length

Max length40
Median length9
Mean length11.72888016
Min length3

Characters and Unicode

Total characters53730
Distinct characters24
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)0.7%

Sample

1st rowoxen_pair
2nd rowoxen_pair
3rd rowhoe oxen_pair
4th rowhoe oxen_pair
5th rowoxen_pair
ValueCountFrequency (%)
oxen_pair 3676
61.6%
motorized 796
 
13.3%
hoe 783
 
13.1%
oxen_pair_neighbour 404
 
6.8%
ox_cow 90
 
1.5%
other 50
 
0.8%
donkey 50
 
0.8%
mechanical 46
 
0.8%
ox_horse 39
 
0.7%
ox_donkey 17
 
0.3%
Wordcloud

Most occurring characters

ValueCountFrequency (%)
o 7263
13.5%
e 6277
11.7%
r 5381
10.0%
i 5326
9.9%
_ 4630
8.6%
n 4597
8.6%
x 4226
7.9%
a 4172
7.8%
p 4080
7.6%
1382
 
2.6%
Other values (14) 6396
11.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 47718
88.8%
Connector Punctuation 4630
 
8.6%
Space Separator 1382
 
2.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 7263
15.2%
e 6277
13.2%
r 5381
11.3%
i 5326
11.2%
n 4597
9.6%
x 4226
8.9%
a 4172
8.7%
p 4080
8.6%
h 1334
 
2.8%
d 863
 
1.8%
Other values (12) 4199
8.8%
Connector Punctuation
ValueCountFrequency (%)
_ 4630
100.0%
Space Separator
ValueCountFrequency (%)
1382
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 47718
88.8%
Common 6012
 
11.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 7263
15.2%
e 6277
13.2%
r 5381
11.3%
i 5326
11.2%
n 4597
9.6%
x 4226
8.9%
a 4172
8.7%
p 4080
8.6%
h 1334
 
2.8%
d 863
 
1.8%
Other values (12) 4199
8.8%
Common
ValueCountFrequency (%)
_ 4630
77.0%
1382
 
23.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 53730
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 7263
13.5%
e 6277
11.7%
r 5381
10.0%
i 5326
9.9%
_ 4630
8.6%
n 4597
8.6%
x 4226
7.9%
a 4172
7.8%
p 4080
7.6%
1382
 
2.6%
Other values (14) 6396
11.9%
Distinct3
Distinct (%)0.1%
Missing8
Missing (%)0.2%
Memory size265.7 KiB
Mini wordcloud

Length

Max length9
Median length2
Mean length2.313905261
Min length2

Characters and Unicode

Total characters10600
Distinct characters10
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowYes
2nd rowYes
3rd rowYes
4th rowYes
5th rowNo
ValueCountFrequency (%)
no 3161
69.0%
yes 1417
30.9%
no_answer 3
 
0.1%
Wordcloud

Most occurring characters

ValueCountFrequency (%)
o 3164
29.8%
N 3161
29.8%
e 1420
13.4%
s 1420
13.4%
Y 1417
13.4%
n 6
 
0.1%
_ 3
 
< 0.1%
a 3
 
< 0.1%
w 3
 
< 0.1%
r 3
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 6019
56.8%
Uppercase Letter 4578
43.2%
Connector Punctuation 3
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 3164
52.6%
e 1420
23.6%
s 1420
23.6%
n 6
 
0.1%
a 3
 
< 0.1%
w 3
 
< 0.1%
r 3
 
< 0.1%
Uppercase Letter
ValueCountFrequency (%)
N 3161
69.0%
Y 1417
31.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 10597
> 99.9%
Common 3
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 3164
29.9%
N 3161
29.8%
e 1420
13.4%
s 1420
13.4%
Y 1417
13.4%
n 6
 
0.1%
a 3
 
< 0.1%
w 3
 
< 0.1%
r 3
 
< 0.1%
Common
ValueCountFrequency (%)
_ 3
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 10600
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 3164
29.8%
N 3161
29.8%
e 1420
13.4%
s 1420
13.4%
Y 1417
13.4%
n 6
 
0.1%
_ 3
 
< 0.1%
a 3
 
< 0.1%
w 3
 
< 0.1%
r 3
 
< 0.1%
Distinct58
Distinct (%)4.1%
Missing3172
Missing (%)69.1%
Memory size191.5 KiB
Mini wordcloud

Length

Max length34
Median length28
Mean length9.649258998
Min length5

Characters and Unicode

Total characters13673
Distinct characters20
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)2.2%

Sample

1st rowsprayer
2nd rowsprayer
3rd rowsprayer
4th rowsprayer
5th rowsprayer
ValueCountFrequency (%)
sprayer 899
49.9%
combiner 738
41.0%
other 41
 
2.3%
row_planter 35
 
1.9%
weeder 33
 
1.8%
drill 26
 
1.4%
reaper 25
 
1.4%
ripper 4
 
0.2%
Wordcloud

Most occurring characters

ValueCountFrequency (%)
r 2764
20.2%
e 1866
13.6%
p 967
 
7.1%
a 959
 
7.0%
s 899
 
6.6%
y 899
 
6.6%
o 814
 
6.0%
n 773
 
5.7%
i 768
 
5.6%
b 738
 
5.4%
Other values (10) 2226
16.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 12650
92.5%
Uppercase Letter 604
 
4.4%
Space Separator 384
 
2.8%
Connector Punctuation 35
 
0.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r 2764
21.8%
e 1866
14.8%
p 967
 
7.6%
a 959
 
7.6%
s 899
 
7.1%
y 899
 
7.1%
o 814
 
6.4%
n 773
 
6.1%
i 768
 
6.1%
b 738
 
5.8%
Other values (7) 1203
9.5%
Uppercase Letter
ValueCountFrequency (%)
C 604
100.0%
Space Separator
ValueCountFrequency (%)
384
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 35
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 13254
96.9%
Common 419
 
3.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
r 2764
20.9%
e 1866
14.1%
p 967
 
7.3%
a 959
 
7.2%
s 899
 
6.8%
y 899
 
6.8%
o 814
 
6.1%
n 773
 
5.8%
i 768
 
5.8%
b 738
 
5.6%
Other values (8) 1807
13.6%
Common
ValueCountFrequency (%)
384
91.6%
_ 35
 
8.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13673
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
r 2764
20.2%
e 1866
13.6%
p 967
 
7.1%
a 959
 
7.0%
s 899
 
6.6%
y 899
 
6.6%
o 814
 
6.0%
n 773
 
5.7%
i 768
 
5.6%
b 738
 
5.4%
Other values (10) 2226
16.3%
Distinct13
Distinct (%)0.9%
Missing3172
Missing (%)69.1%
Memory size184.2 KiB
Mini wordcloud

Length

Max length17
Median length4
Mean length4.369089626
Min length3

Characters and Unicode

Total characters6191
Distinct characters11
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)0.2%

Sample

1st rowOwn
2nd rowOwn
3rd rowOwn
4th rowOwn
5th rowOwn
ValueCountFrequency (%)
rent 920
58.0%
own 546
34.4%
borrow 110
 
6.9%
other 9
 
0.6%
Wordcloud

Most occurring characters

ValueCountFrequency (%)
n 1466
23.7%
e 929
15.0%
t 929
15.0%
R 920
14.9%
w 656
10.6%
O 546
 
8.8%
o 229
 
3.7%
r 229
 
3.7%
168
 
2.7%
B 110
 
1.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 4447
71.8%
Uppercase Letter 1576
 
25.5%
Space Separator 168
 
2.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 1466
33.0%
e 929
20.9%
t 929
20.9%
w 656
14.8%
o 229
 
5.1%
r 229
 
5.1%
h 9
 
0.2%
Uppercase Letter
ValueCountFrequency (%)
R 920
58.4%
O 546
34.6%
B 110
 
7.0%
Space Separator
ValueCountFrequency (%)
168
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 6023
97.3%
Common 168
 
2.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 1466
24.3%
e 929
15.4%
t 929
15.4%
R 920
15.3%
w 656
10.9%
O 546
 
9.1%
o 229
 
3.8%
r 229
 
3.8%
B 110
 
1.8%
h 9
 
0.1%
Common
ValueCountFrequency (%)
168
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6191
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 1466
23.7%
e 929
15.0%
t 929
15.0%
R 920
14.9%
w 656
10.6%
O 546
 
8.8%
o 229
 
3.7%
r 229
 
3.7%
168
 
2.7%
B 110
 
1.8%
Distinct45
Distinct (%)1.4%
Missing1428
Missing (%)31.1%
Memory size276.1 KiB
Mini wordcloud

Length

Max length53
Median length17
Mean length17.94337235
Min length5

Characters and Unicode

Total characters56719
Distinct characters19
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)0.3%

Sample

1st rowavailability_area
2nd rowavailability_area
3rd rowavailability_area expensive other
4th rowavailability_area other
5th rowavailability_area other
ValueCountFrequency (%)
availability_area 2458
63.4%
expensive 574
 
14.8%
availability_time 347
 
8.9%
no_need 325
 
8.4%
other 175
 
4.5%
Wordcloud

Most occurring characters

ValueCountFrequency (%)
a 13331
23.5%
i 9336
16.5%
l 5610
9.9%
e 5352
9.4%
v 3379
 
6.0%
t 3327
 
5.9%
_ 3130
 
5.5%
b 2805
 
4.9%
y 2805
 
4.9%
r 2633
 
4.6%
Other values (9) 5011
 
8.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 52871
93.2%
Connector Punctuation 3130
 
5.5%
Space Separator 718
 
1.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 13331
25.2%
i 9336
17.7%
l 5610
10.6%
e 5352
10.1%
v 3379
 
6.4%
t 3327
 
6.3%
b 2805
 
5.3%
y 2805
 
5.3%
r 2633
 
5.0%
n 1224
 
2.3%
Other values (7) 3069
 
5.8%
Connector Punctuation
ValueCountFrequency (%)
_ 3130
100.0%
Space Separator
ValueCountFrequency (%)
718
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 52871
93.2%
Common 3848
 
6.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 13331
25.2%
i 9336
17.7%
l 5610
10.6%
e 5352
10.1%
v 3379
 
6.4%
t 3327
 
6.3%
b 2805
 
5.3%
y 2805
 
5.3%
r 2633
 
5.0%
n 1224
 
2.3%
Other values (7) 3069
 
5.8%
Common
ValueCountFrequency (%)
_ 3130
81.3%
718
 
18.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 56719
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 13331
23.5%
i 9336
16.5%
l 5610
9.9%
e 5352
9.4%
v 3379
 
6.0%
t 3327
 
5.9%
_ 3130
 
5.5%
b 2805
 
4.9%
y 2805
 
4.9%
r 2633
 
4.6%
Other values (9) 5011
 
8.8%
Distinct3
Distinct (%)0.1%
Missing8
Missing (%)0.2%
Memory size268.3 KiB
Mini wordcloud

Length

Max length9
Median length3
Mean length2.881248636
Min length2

Characters and Unicode

Total characters13199
Distinct characters10
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowYes
2nd rowYes
3rd rowYes
4th rowYes
5th rowYes
ValueCountFrequency (%)
yes 4030
88.0%
no 550
 
12.0%
no_answer 1
 
< 0.1%
Wordcloud

Most occurring characters

ValueCountFrequency (%)
e 4031
30.5%
s 4031
30.5%
Y 4030
30.5%
o 551
 
4.2%
N 550
 
4.2%
n 2
 
< 0.1%
_ 1
 
< 0.1%
a 1
 
< 0.1%
w 1
 
< 0.1%
r 1
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 8618
65.3%
Uppercase Letter 4580
34.7%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 4031
46.8%
s 4031
46.8%
o 551
 
6.4%
n 2
 
< 0.1%
a 1
 
< 0.1%
w 1
 
< 0.1%
r 1
 
< 0.1%
Uppercase Letter
ValueCountFrequency (%)
Y 4030
88.0%
N 550
 
12.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 13198
> 99.9%
Common 1
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 4031
30.5%
s 4031
30.5%
Y 4030
30.5%
o 551
 
4.2%
N 550
 
4.2%
n 2
 
< 0.1%
a 1
 
< 0.1%
w 1
 
< 0.1%
r 1
 
< 0.1%
Common
ValueCountFrequency (%)
_ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13199
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 4031
30.5%
s 4031
30.5%
Y 4030
30.5%
o 551
 
4.2%
N 550
 
4.2%
n 2
 
< 0.1%
_ 1
 
< 0.1%
a 1
 
< 0.1%
w 1
 
< 0.1%
r 1
 
< 0.1%
Distinct119
Distinct (%)3.0%
Missing559
Missing (%)12.2%
Memory size319.3 KiB
Mini wordcloud

Length

Max length48
Median length44
Mean length19.64987593
Min length5

Characters and Unicode

Total characters79189
Distinct characters19
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)1.2%

Sample

1st rowexpensive availability
2nd rowexpensive availability
3rd rowexpensive availability other
4th rowavailability expensive other
5th rowexpensive
ValueCountFrequency (%)
expensive 3796
48.4%
availability 2767
35.3%
types 447
 
5.7%
other 287
 
3.7%
transport 252
 
3.2%
far 178
 
2.3%
no_constraint 118
 
1.5%
Wordcloud

Most occurring characters

ValueCountFrequency (%)
i 12215
15.4%
e 12122
15.3%
a 8849
11.2%
v 6563
8.3%
l 5534
7.0%
s 4613
 
5.8%
p 4495
 
5.7%
n 4402
 
5.6%
t 4241
 
5.4%
3815
 
4.8%
Other values (9) 12340
15.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 75256
95.0%
Space Separator 3815
 
4.8%
Connector Punctuation 118
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 12215
16.2%
e 12122
16.1%
a 8849
11.8%
v 6563
8.7%
l 5534
7.4%
s 4613
 
6.1%
p 4495
 
6.0%
n 4402
 
5.8%
t 4241
 
5.6%
x 3796
 
5.0%
Other values (7) 8426
11.2%
Space Separator
ValueCountFrequency (%)
3815
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 118
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 75256
95.0%
Common 3933
 
5.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 12215
16.2%
e 12122
16.1%
a 8849
11.8%
v 6563
8.7%
l 5534
7.4%
s 4613
 
6.1%
p 4495
 
6.0%
n 4402
 
5.8%
t 4241
 
5.6%
x 3796
 
5.0%
Other values (7) 8426
11.2%
Common
ValueCountFrequency (%)
3815
97.0%
_ 118
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 79189
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 12215
15.4%
e 12122
15.3%
a 8849
11.2%
v 6563
8.3%
l 5534
7.0%
s 4613
 
5.8%
p 4495
 
5.7%
n 4402
 
5.6%
t 4241
 
5.4%
3815
 
4.8%
Other values (9) 12340
15.6%
Distinct3
Distinct (%)0.1%
Missing8
Missing (%)0.2%
Memory size264.5 KiB
Mini wordcloud

Length

Max length9
Median length2
Mean length2.046059812
Min length2

Characters and Unicode

Total characters9373
Distinct characters10
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowNo
2nd rowNo
3rd rowYes
4th rowYes
5th rowNo
ValueCountFrequency (%)
no 4376
95.5%
yes 204
 
4.5%
no_answer 1
 
< 0.1%
Wordcloud

Most occurring characters

ValueCountFrequency (%)
o 4377
46.7%
N 4376
46.7%
e 205
 
2.2%
s 205
 
2.2%
Y 204
 
2.2%
n 2
 
< 0.1%
_ 1
 
< 0.1%
a 1
 
< 0.1%
w 1
 
< 0.1%
r 1
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 4792
51.1%
Uppercase Letter 4580
48.9%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 4377
91.3%
e 205
 
4.3%
s 205
 
4.3%
n 2
 
< 0.1%
a 1
 
< 0.1%
w 1
 
< 0.1%
r 1
 
< 0.1%
Uppercase Letter
ValueCountFrequency (%)
N 4376
95.5%
Y 204
 
4.5%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 9372
> 99.9%
Common 1
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 4377
46.7%
N 4376
46.7%
e 205
 
2.2%
s 205
 
2.2%
Y 204
 
2.2%
n 2
 
< 0.1%
a 1
 
< 0.1%
w 1
 
< 0.1%
r 1
 
< 0.1%
Common
ValueCountFrequency (%)
_ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9373
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 4377
46.7%
N 4376
46.7%
e 205
 
2.2%
s 205
 
2.2%
Y 204
 
2.2%
n 2
 
< 0.1%
_ 1
 
< 0.1%
a 1
 
< 0.1%
w 1
 
< 0.1%
r 1
 
< 0.1%
Distinct13
Distinct (%)6.4%
Missing4385
Missing (%)95.6%
Memory size151.0 KiB
Mini wordcloud

Length

Max length33
Median length13
Mean length12.41176471
Min length5

Characters and Unicode

Total characters2532
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)1.0%

Sample

1st rowother transport
2nd rowtransport other
3rd rowtransport
4th rowtransport other
5th rowtransport other
ValueCountFrequency (%)
no_constraint 122
50.8%
transport 57
23.8%
other 38
 
15.8%
expensive 23
 
9.6%
Wordcloud

Most occurring characters

ValueCountFrequency (%)
n 446
17.6%
t 396
15.6%
o 339
13.4%
r 274
10.8%
s 202
8.0%
a 179
7.1%
i 145
 
5.7%
_ 122
 
4.8%
c 122
 
4.8%
e 107
 
4.2%
Other values (5) 200
7.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2374
93.8%
Connector Punctuation 122
 
4.8%
Space Separator 36
 
1.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 446
18.8%
t 396
16.7%
o 339
14.3%
r 274
11.5%
s 202
8.5%
a 179
7.5%
i 145
 
6.1%
c 122
 
5.1%
e 107
 
4.5%
p 80
 
3.4%
Other values (3) 84
 
3.5%
Connector Punctuation
ValueCountFrequency (%)
_ 122
100.0%
Space Separator
ValueCountFrequency (%)
36
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2374
93.8%
Common 158
 
6.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 446
18.8%
t 396
16.7%
o 339
14.3%
r 274
11.5%
s 202
8.5%
a 179
7.5%
i 145
 
6.1%
c 122
 
5.1%
e 107
 
4.5%
p 80
 
3.4%
Other values (3) 84
 
3.5%
Common
ValueCountFrequency (%)
_ 122
77.2%
36
 
22.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2532
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 446
17.6%
t 396
15.6%
o 339
13.4%
r 274
10.8%
s 202
8.0%
a 179
7.1%
i 145
 
5.7%
_ 122
 
4.8%
c 122
 
4.8%
e 107
 
4.2%
Other values (5) 200
7.9%
Distinct5
Distinct (%)0.1%
Missing28
Missing (%)0.6%
Memory size278.1 KiB
Mini wordcloud

Length

Max length14
Median length6
Mean length5.212453409
Min length4

Characters and Unicode

Total characters23774
Distinct characters18
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowmedium
2nd rowdifficult
3rd roweasy
4th roweasy
5th rowmedium
ValueCountFrequency (%)
medium 2405
52.7%
easy 2021
44.3%
difficult 121
 
2.7%
very_difficult 9
 
0.2%
no_answer 5
 
0.1%
Wordcloud

Most occurring characters

ValueCountFrequency (%)
m 4810
20.2%
e 4440
18.7%
i 2665
11.2%
d 2535
10.7%
u 2535
10.7%
y 2030
8.5%
a 2026
8.5%
s 2026
8.5%
f 260
 
1.1%
t 130
 
0.5%
Other values (8) 317
 
1.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 23760
99.9%
Connector Punctuation 14
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
m 4810
20.2%
e 4440
18.7%
i 2665
11.2%
d 2535
10.7%
u 2535
10.7%
y 2030
8.5%
a 2026
8.5%
s 2026
8.5%
f 260
 
1.1%
t 130
 
0.5%
Other values (7) 303
 
1.3%
Connector Punctuation
ValueCountFrequency (%)
_ 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 23760
99.9%
Common 14
 
0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
m 4810
20.2%
e 4440
18.7%
i 2665
11.2%
d 2535
10.7%
u 2535
10.7%
y 2030
8.5%
a 2026
8.5%
s 2026
8.5%
f 260
 
1.1%
t 130
 
0.5%
Other values (7) 303
 
1.3%
Common
ValueCountFrequency (%)
_ 14
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 23774
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
m 4810
20.2%
e 4440
18.7%
i 2665
11.2%
d 2535
10.7%
u 2535
10.7%
y 2030
8.5%
a 2026
8.5%
s 2026
8.5%
f 260
 
1.1%
t 130
 
0.5%
Other values (8) 317
 
1.3%

quality_reliability
Real number (ℝ)

Distinct5
Distinct (%)0.1%
Missing42
Missing (%)0.9%
Infinite0
Infinite (%)0.0%
Mean4.149109303
Minimum1
Maximum5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size36.0 KiB
Mini histogram

Quantile statistics

Minimum1
5-th percentile3
Q14
median4
Q35
95-th percentile5
Maximum5
Range4
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.708645606
Coefficient of variation (CV)0.1707946343
Kurtosis-0.1353531028
Mean4.149109303
Median Absolute Deviation (MAD)0
Skewness-0.4404794942
Sum18866
Variance0.502178595
MonotonicityNot monotonic
Histogram
Histogram with fixed size bins (bins=5)
ValueCountFrequency (%)
4 2328
50.7%
5 1475
32.1%
3 694
 
15.1%
2 47
 
1.0%
1 3
 
0.1%
(Missing) 42
 
0.9%
ValueCountFrequency (%)
1 3
 
0.1%
2 47
 
1.0%
3 694
 
15.1%
4 2328
50.7%
5 1475
32.1%
ValueCountFrequency (%)
5 1475
32.1%
4 2328
50.7%
3 694
 
15.1%
2 47
 
1.0%
1 3
 
0.1%